Generalization in Reinforcement Learning with a Task-Related World Description using Rules

نویسندگان

Alejandro Agostini

Enric Celaya

چکیده

A Reinforcement Learning problem is formulated as trying to find the action policy that maximizes the accumulated reward received by the agent through time. One of the most popular algorithms used in RL is QLearning which uses an action-value function q(s,a) to evaluate the expectation of the maximum future cumulative reward that will be obtained from executing action a in situation s. Q-Learning, as well as conventional RL techniques, is defined for discrete environments with a finite set of states and actions. The action-value function is explicitly represented by storing values for each state-action (s,a) pair. In order to reach a good approximation of the value function all the (s,a) pairs must be experienced many times but in practical applications the amount of experience for learning to take place is unfeasible. Therefore, the value function must be generalized to infer in situations never experienced so far. The generalization problem has been widely treated in the field of machine learning. Supervised learning directly treats this issue and many generalization techniques have been developed in this field. Any of the representations used in supervised learning could, in principle, be applied to RL. But there are some important issues to take into account that make good generalization in RL very hard to achieve. One of the most remarkable is that the value function is learned while represented. In this work we propose a RL approach that uses a new function representation of the Q function that allows good generalization by capturing function regularities into decision rules. The representation is a kind of Decision List where each rule configures a subspace of the state-action space and provides an approximation of the Q function in its covered region. Rule selection for action evaluation is given by the rule with both, good accuracy in the estimation and high confidence in the related statistics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Different Task Types on Learning Prepositions in Form–Focused and Meaning–Focused Interaction Enhancement-Based Classes

The current study examines the impact of different task types on learning prepositions in form and meaning- focused interaction enhancement- based classes. The participants were 57 second Year University students enrolled in three intact lab classes at Tabriz Islamic Azad University. The first group was provided with form-focused interaction enhancement, the second with the meaning-focused int...

متن کامل

A Q-learning Based Continuous Tuning of Fuzzy Wall Tracking

A simple easy to implement algorithm is proposed to address wall tracking task of an autonomous robot. The robot should navigate in unknown environments, find the nearest wall, and track it solely based on locally sensed data. The proposed method benefits from coupling fuzzy logic and Q-learning to meet requirements of autonomous navigations. Fuzzy if-then rules provide a reliable decision maki...

متن کامل

An Adaptive Learning Game for Autistic Children using Reinforcement Learning and Fuzzy Logic

This paper, presents an adapted serious game for rating social ability in children with autism spectrum disorder (ASD). The required measurements are obtained by challenges of the proposed serious game. The proposed serious game uses reinforcement learning concepts for being adaptive. It is based on fuzzy logic to evaluate the social ability level of the children with ASD. The game adapts itsel...

متن کامل

RoboCup Agent Learning from Observations with Hierarchical Multiple Decision Trees

It is a dif£cult task to hand-code optimal condition-action rules for software agents. A solution to this is reinforcement learning. In reinforcement learning, agents acquire the condition-action rules by learning from their experiences. However, acquisition of complicated rules might take a great amount of learning time and learning might not converge. To solve these drawbacks, an approach cal...

متن کامل

Robot Task Learning based on Reinforcement Learning in Virtual Space

− As a novel learning method, reinforced learning by which a robot acquires control rules through trial and error has gotten a lot of attention. However, it is quite difficult for robots to acquire control rules by reinforcement learning in real space because many learning trials are needed to achieve the control rules; the robot itself may lose control, or there may be safety problems with the...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2009

Generalization in Reinforcement Learning with a Task-Related World Description using Rules

نویسندگان

چکیده

منابع مشابه

The Effect of Different Task Types on Learning Prepositions in Form–Focused and Meaning–Focused Interaction Enhancement-Based Classes

A Q-learning Based Continuous Tuning of Fuzzy Wall Tracking

An Adaptive Learning Game for Autistic Children using Reinforcement Learning and Fuzzy Logic

RoboCup Agent Learning from Observations with Hierarchical Multiple Decision Trees

Robot Task Learning based on Reinforcement Learning in Virtual Space

عنوان ژورنال:

اشتراک گذاری